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Abstract — In this paper, we introduce a time-stampless adap- 
tive nonuniform sampling (TANS) framework, in which time 
increments between samples are determined by a function of 
the m most recent increments and sample values. Since only 
past samples are used in computing time increments, it is not 
necessary to save sampling times (time stamps) for use in the 
reconstruction process. We focus on two TANS schemes for 
discrete-time stochastic signals: a greedy method, and a method 
based on dynamic programming. We analyze the performances 
of these schemes by computing (or bounding) their trade-offs 
between sampling rate and expected reconstruction distortion 
for autoregressive and Markovian signals. Simulation results 
support the analysis of the sampling schemes. We show that by 
opportunistically adapting to local signal characteristics TANS 
may lead to improved power efficiency in some applications. 



I. Introduction 

Sampling is essential in any digital system that interfaces 
with the analog world. All else being equal, it is desirable 
to minimize the number of samples while maintaining an 
acceptable reconstruction distortion. In some applications, 
minimizing the number of samples can be translated into 
having a power-efficient sampling, since the power consump- 
tion at an analog-to-digital converter (ADC) is approximately 
proportional to its sampling rate Also, having fewer 
samples can increase the efficiency of other processing of 
these measurements. For example, if these samples should be 
transmitted to another place via a communication channel, 
having fewer samples will improve power and bandwidth 
efficiencies. 

One can view sampling as a query to obtain information 
from a signal or a function that can only be measured remotely. 
We get a sample of this function at an arbitrary time when we 
query for it over a communication medium. Here, a portion of 
the operational cost (e.g., power) is proportional to the number 
of samples that we acquire. Hence, again it is desirable to 
minimize the number of samples taken. 

A uniform sampling at the Nyquist rate of the signal 
may cause some redundant samples, since the global signal 
bandwidth may not be a good measure of local variations of 
the signal. Although traditional nonuniform sampling schemes 
(e.g., |2|j El) deal with this problem, they have certain 
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Fig. 1. A schematic view of the TANS framework: sampling times are 
determined by a function of m most recently taken samples. Hence, it is not 
necessary to save sampling times (time stamps) for use in the reconstruction 
process. 



limitations. Firstly, they are mostly designed to operate un- 
der specific conditions for restrictive signal models (e.g., 
Pl-llSl) and, secondly, sampling times (i.e., time stamps) 
must be stored/transmitted to be used in the reconstruction 
process. This may cause power/bandwidth inefficiencies in 
sampling/communication procedures. 

In this paper, we introduce a new framework for an adaptive 
nonuniform sampling scheme (see Figure [U. The key idea 
of this framework is that time increments between samples 
are computed by using a function of previously taken sam- 
ples. Therefore, keeping sampling times (time stamps), except 
initialization times, is not necessary. The function by which 
sampling time intervals is computed is called the sampling 
function. The aim of this sampling framework is to have a 
balance between the reconstruction distortion and the average 
sampling rate. We refer to this sampling framework as Time- 
stampless Adaptive Nonuniform Sampling (TANS). The TANS 
concept can be applied on continuous- or discrete-time signals, 
and the design and analysis can be based on deterministic or 
stochastic models. 

The TANS framework is described in general terms in Sec- 
tion Section |III] then formalizes the problem setup for this 
paper, where we focus on discrete-time signals and stochastic 
models. A greedy method is developed in Section |IV] and a 
method based on dynamic programming (DP) is developed 
in Section [V] Simulations results are provided in Section [Vll 
and some proofs are relegated to Section IVIII Section IVIIII 
concludes the paper 

II. TANS Framework 

In this section, we introduce the TANS framework and 
sketch design approaches that will be developed in more detail 
in later sections. Fix some nonnegative integer m and suppose 
the ith sample of signal X{t) is taken at time ti. We take the 
[i + l)st sample after a time increment of 

T, = / {{{t,,X{t,)) : z - m + 1 < J < i}) , 
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where / is called the sampling function. This makes the 
sampling rate adapt to local characteristics of the signal. Since 
the time increment is a function of the m most recently taken 
samples, we say the order of the sampUng function / is m. 
The sampling is nonuniform except in the trivial cases when / 
is a constant-valued function (e.g., m = 0). Some initialization 
of the first m sampling times is necessary, but the effect of 
this initialization on the rate is amortized. 

The sampling function is known at the reconstruction side. 
Assuming that the state 

St^ = {{tj,X{tj)) ■.i-m + l<j<i} 

is also known at the reconstruction side when reconstructing 
X{t) on [ti, ti+i], there is no need for the sampling times (time 
stamps) to be transmitted. These times can be computed by 
using the sampling function and previously taken samples: 



t 



i+l 



= u 



fiSu). 



This type of synchronization in an adaptive system without ex- 
plicit communication is often called backward adaptation |9|. 
In a practical setting involving both sampling and quantization, 
backward adaptivity requires using the quantized values to 
drive the adaptation [101-[13|. Here, to maintain focus on 
sampling rate and adaptation of sampling increments, we do 
not explicitly include quantization effects. Note that while the 
sampling time selection is causal, the reconstruction method 
can be causal or non-causal. 

The aim of TANS is to balance between the average sam- 
pling rate and the reconstruction distortion. This objective is 
different from the one considered in change point analysis [14| 
or active learning ifTSl . There, the objective is to find points of 
the signal at which statistical behaviors of the signal change, 
by causal or non-causal sampling, respectively. 

Suppose X{t) is the reconstructed signal computed by some 
reconstruction method. For the case of discrete time and 
a stochastic signal model, define d{Sti,Ti) as the expected 
reconstruction distortion over samples from time ti + 1 until 
time — 1. That is. 



diSt„T,)^E, 



X 



D{X{t),X{t)) 

.t=ti+i 



where X is the known probabilistic model of the signal 
X{t) and D{X(t), X{t)) represents the distortion at time 
Note that at times ti and ti+i the reconstruction distortion 
is zero since exact sample values are known at these times. 
In realistic cases and for a given state St^, d{Sti,Ti) is an 
increasing function with respect to Ti, because the greater the 
next sampling step, the greater the reconstruction distortion. 
On the other hand, the greater the next sampling step, the 
larger the rate benefit. Hence, a rate penalty can be defined 
as a{Sti,Ti) — p/f{Sti) = p/Ti, where p is a rate award 
parameter. We define the cost of each sampling state as the sum 
of the expected reconstruction distortion and the rate penalty, 
that is, c{St,,T) = d{St„T^) + a{St„T^). The overall cost 

' An analogous formulation for continuous time would replace the sum with 
an integral over t G [ti, Without a stochastic model, a maximum error 

criterion could be used. 
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Fig. 2. Demonstrating the behavior of different parts of sampling state cost 
c{St^ ,Ti) = d{Sti > Ti) + a{Sti > ^i)- where d(5t . , T^) is the reconstruction 
distortion and a{Sti,Ti) is the rate penalty function. 



of the sampling process is the sum of different sampling state 
costs, that is, E»c(5t,,T,). 

Finding an appropriate sampling function for TANS depends 
on requirements such as average sampling rate, maximum 
distortion, etc. In this paper, we investigate two general 
approaches to computing appropriate sampling functions for 
given sampling setups: greedy methods and dynamic program- 
ming (DP) methods. 

In greedy methods, a sampling function at state S'f . chooses 
the next sampling increment Ti to minimize the sampling state 
cost c{Sti , Ti). As depicted in Figure|2] for any given state St^, 
d{Sti,Ti) is an increasing function of T, while a{Sti,Ti) is 
a decreasing function. Therefore, there is a trade-off between 
the sampling rate and the expected reconstruction distortion. 
A greedy method balances this trade-off by choosing the state 
cost minimizer as the next sampling increment. In certain 
cases, greedy sampling schemes can perform closely to an 
optimal scheme. 

Since c{Sti,Ti) depends on the current sampling state St^, 
a greedy sampling function does not take into account charac- 
teristics of the next sampling state. Intuitively, the larger the 
sampling increment Ti at the sampling state St^, the lower the 
quality of the next sampling state. Hence, in general, greedy 
methods are not optimal sampling schemes considering the 
overall sampling cost c(5't. ,Ti) as a comparison measure. 
We consider effects of the next sampling states' quality in 
DP methods. We show that an exact Bellman-Ford equation 
(BFE) can be written and solved for some sampling setups. 
For those cases, the solution of BFE provides an optimal 
sampling function which minimizes the overall sampling cost. 
In cases where solving the BFE is not practically feasible 
(because the number of possible sampling states is large or 
exact sampling states are not known), we propose sampling 
functions based on approximate dynamic programming (ADP) 
algorithms. Sampling functions derived by greedy methods 
can be used in ADP-based sampling methods. In fact, greedy 
methods can be viewed as DP-based methods with a unit time 
horizon. 

In this paper, we consider two examples of stochastic 
signals: stationary signals and Markovian signals. Unlike 
stationary signals, Markovian processes have sudden changes 
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in their statistical properties based on an underlying hidden 
Markov chain. Note that the general sampling framework can 
be applied on other signal models. 

III. Problem Setup, Signal Models and 
Background Results 

In this section, we first present the problem setup of TANS. 
Then, we introduce signal models considered in this paper At 
the end of this section, a generalized linear prediction filter 
is proposed, which linearly predicts the future samples of a 
stationary process by using a set of nonuniform samples from 
its past. 

A. Problem Setup 

Consider a discrete-time signal X{t). TANS with order m 
is used to take samples from this signal. Hence, the next 
sampling increment at time ti is a function of m most recently 
taken samples at that time (i.e., Ti = /(S'tJ). The set of 
samples taken using the sampling function /(•) is denoted by 
^x(t)' ^ function g{-) is used to reconstruct the original signal 
from its samples, that is, X{t) = g{r(^^^^). The reconsti'uction 

error signal is e{t) = X{t) — X{t). The overall sampling cost 
is the sum of sampling state costs over different states, that is, 
Ctot{X{t)J,g)^J:^c{Su,T,). 

A system optimization problem under the TANS framework 
can be stated as follows: 

Definition 1 (Optimal TANS sampling problem): For a 
class of signals X{t), a given reconstruction function g{-), 
and an order m, a sampling function /*(•) is desired to 
minimize the overall expected sampling cost: 

/* = arginf ctot(/,.9)- (1) 
/ 

The resulting cost is denoted c^^j = Ctot(/*,ff)- 

B. Signal Models 

In this paper, we consider the following signal models: 

• Case 1: an autoregressive signal with memory of one (i.e., 
AR(1)): 

X{t + l)=aX{t) + Z{t + l), (2) 

where Z{t + 1) is a Gaussian noise with zero mean. If 
the power of the signal is assumed to be one, the noise 
variance is 1 — a^. 

• Case 2: a Markovian signal: 

X{t + l)^ag,X{t) + Zg^{t + l), (3) 

where 6t represents the state of a hidden Markov chain 
(MC) with state transition probabilities depicted in Fig- 
ure|3] At time t, if the MC is at state 0, Ot = 0; otherwise, 
9t = 1- Depending on the value of 9t, the signal is 
generated by a first-order AR model with parameter ag^ 
and the noise variance 1 — a^^ . Note that, in this model, 
unlike the previous case, the coefficient of the AR model 
has a sudden change in time depending on the state of 
the underlying hidden Markov chain. 




Pio 



Fig. 3. A hidden Markov chain considered in Markovian signal model of 
equation ^3)- 

C. Generalized Linear Prediction Filter 

Suppose X{t) is a stationary signal. Assume we have m 
samples of X{t) at times . . . , t,}. Our aim is to 

linearly predict X{t), where t = ti + T for some T > 1, by 
using these known samples so that the expected mean square 
error is minimized (MMSE predictor). 

Consider X{t) as a predicted value of X{t) by using these 
m sample values. The prediction error is e(t) = X{t) — X{t). 
Define Tk = t — ti^k for < fc < m — 1. We want to find 
optimal linear prediction weights Wrg, ■ • ■ , Wr^_-^ so that the 
prediction error power is minimized: 

min £^[|e(t)n (4) 

m — 1 

subject to X{t) ~ WrkX{ti^k)- 

A solution of this linear optimization is referred as w*., for 
< i < m — 1. Note that, unlike a regular linear prediction 
filter ( |fT6|), a generalized linear prediction filter predicts X{t) 
by using a set of nonuniform samples. 

The auto-correlation function of X{t) can be written as 

r{i)^E[X{t)X%t-i)], (5) 

where X'^(i) represents the complex conjugate of X{t). In this 
paper, we deal with real signals. 

To simplify notations, we define the following matrices: 

T = [ro, . . . ,T,„_i]^ (6) 
P = [r{-To),...,r{~Tm-i)]'^ 

Also, a m X m auto-correlation matrix R is defined whose 
component in the ith row and jth column is r{Ti — Tj). 

The following theorem provides optimal weights for the 
generalized linear prediction filter: 

Theorem 2: 

p = Rw;. 

Proof: See Section lylTAl ■ 
For X{t) with zero mean, the variance of the prediction 
error is defined as follows: 

al,{Su,T)^E[\e*{m. 
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The following theorem provides a way to compute the vari- 
ance of the prediction error for a generalized linear prediction 
filter: 

Theorems: al,{Su, T) ^ r (0) 
Proof: See Section lylTB] 



IV. Greedy TANS 

In this section, we investigate greedy sampling functions 
various signal models. In Section [Vl we evaluate performance 
of these schemes and compare them with some other schemes, 
including uniform sampling setups. In all of these sampling 
schemes, the reconstruction function is assumed to be a 
generalized linear prediction filter, introduced in Section lTlI-CI 
Note that it is a causal reconstruction function. 

In greedy methods, a sampling function is computed as 
follows; 



Ti = argmin c(5t;,T) 

T 

subject to T > 1, 



(7) 



where /(S'tJ = T^. The sampling function of a greedy method 
depends on the current sampling state and does not take into 
account characteristics of the next sampling states. Intuitively, 
the larger the sampling increment Ti at the sampling state St^, 
the lower the quality of the next sampling state. Therefore, we 
have a trade-off between the sampling rate award of the current 
state and the sampling cost of the next state (Figure|2l). Hence, 
greedy methods usually are not optimal solutions of a TANS 
sampling problem presented in Definition [T] However, these 
greedy solutions can be used to approximate optimal solutions, 
which may have high computational complexity. 

Here, we investigate the greedy sampling function of (|7]l for 
two signal classes described in Section ITlI-BI We analyze their 
sampling rate versus the expected reconstruction distortion in 
Theorems |4] and [8] Simulation results of proposed schemes 
are shown in Section [Vll 

A. Greedy TANS for Autoregressive Signals 

In this section, we consider greedy sampling functions for 
an AR(1) signal model described in (|2]i. 

Suppose /gi.ccdy('S'ti) is an optimal greedy sampHng func- 
tion. Also, suppose c?^Qj is the expected reconstruction dis- 
tortion per sample corresponding to this sampling function. 
The following theorem introduces an optimal greedy sampling 
function and its expected reconstruction distortion per sample. 

Theorem 4: For an AR(1) signal with parameter a de- 
scribed in (|2]i, over a large enough time interval [0, Ttot], an 
optimal greedy sampling function is 



greedy 



(8) 



T*-l 



P/T 



2j 



where T* = arg min^- 

Proof: See Section IVlfel ■ 
Note that, for this signal model, an optimal greedy sampling 
function yields uniform sampling with the sampling rate 1 /T*, 



- 



which performs closely to an optimal sampling scheme. In the 
following corollary, we present a formula to compute T*. 

Corollary 5: Suppose Tioot is a solution of the following 
equation: 



T(T+ 1) 







(9) 



where < a < 1 and p > 0. If (1 - a^) < f , then 

(i) Jroot is unique; and 

(ii) T* = [TrootJ or T* = [TrootJ + 1. 

Proof: See Section [ylTDl ■ 
We will validate this by simulation in Section |Vl] Also, 
arguments of Theorem |4] can be extended for a general 
stationary signal. 

For an AR signal, since statistical properties of the signal 
do not vary in time, there is no rate adaption with respect to 
sample values. This is not the case for Markovian signals. We 
investigate the greedy TANS framework for Markovian signals 
in the next section; a rate adaption with respect to sample 
values would be helpful to minimize the sampling cost and 
leads to a nonuniform sampling scheme. 



B. Greedy TANS for Markovian Signals 

Consider a Markovian signal described by ([3]), where 6t 
represents the state of a hidden underlying Markov chain 
depicted in Figure [3] In this section, for simplicity we assume 
the MC is symmetric (i.e., poi — Pio)- However, all arguments 
can be extended for a general MC. We also assume that uq 
and ai are known. However, the state of the Markov chain 
(i.e., 9t) is unknown and needed to be estimated by using the 
taken samples. We use a generalized linear prediction filter for 
the reconstruction. Note that if ao and ai are also unknowns, 
one can learn these parameters at the beginning of the process 
by taking more samples. Then, our proposed scheme can be 
applied for the rest of the process. 

Extending the previous notation, define 63^. as the state 
of the MC during the sampling state St^- If during St^ the 
MC state stays at zero, 63^. = 0. Similarly, if the MC 
state stays at one, 63^. = 1. Otherwise, if there is an MC 
transition within this sampling state, 63^ = 2. We assume 
that 63^ is unknown and needs to be estimated by using 
the taken samples. The estimated value of 63^, is referred 
by 6*5^ . The error probability of this estimation is referred by 

P,{Su) ^ Pr{§3,^ ^03, J. 

Algorithm 6: A greedy sampling function for the consid- 
ered Markovian signal has the following steps: 

« Step i,0: Compute §3^^ and Pe(5'tJ. 

• Step i,l: Compute = arg min^ c(S't., Tj^gj ), where 
c{Sti,T\63^ ) is the sampling state cost given 63^ (see 
([IB and Cll)). 

• Step i,2: Take a sample at time ti + Ti. 

• Step i,3: Compute S't^+i. Repeat. 

For simplicity, we assume that the sampling increment T is 
small enough that the probability of having more than one MC 
transition is negligible. In other words, we have the following 
assumptions. 
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Assumption 7: 



maxTi < Tup, 

i 



Pi 



(T„p-1) 



If MC transition probabilities poi and pio are small enough, 
this assumption is reasonable. Under this assumption, the 
sampling state cost, c{Sti,T), can be conditioned on the value 
of 6 as follows: 



c{St„T\0s,. =0) 



T-l 



j=0 1=1 



+ (T-l-j>Lx} 
+Pr{§s. ^9sJ<J, 



2 ^P. 

max 



T 



T-l 



{l-Pe{Su)){j2pioPoiY.(^-al^) 



1=1 



+ (T-l-j>L.} 

+ Pe(5t.)(r-lKa.+ |, 



(10) 



which says that, if the estimation is correct (with probability 
1 — Pe {Sti )) and an MC transition happens at time ti+ j + 1 
(with probability PqqPqi), the sampling state cost is J2i=i 1 ^ 
ttg' + (T — 1 —j)cr'^^^^, where cr^^ax '^he maximum prediction 
error variance (in this example, cr^ax = !)■ the estimation 
process fails (with probability (5*4 . )), a maximum prediction 
error variance Un^ax occurs. By using the assumption p^^'' ^ 
1/2, equation ( fTOl i can be simplified as follows; 



2 P 
7 — 

max 



T 



The sampling state cost function conditioned on 6*5^ = 1 
(i.e., c{Sti,T I ^Sj. = 1)) can be written similarly. 

Finally, for the case 63^. — 2, we assume that the prediction 
variance is the maximum prediction error variance 



72 : 

max 



ciSu,T\ 



"St, 



^2) = {T-l)a, 



2 

max 



p_ 

T' 



(12) 



We analyze the performance of the proposed greedy sam- 
pling scheme in the following theorems. We derive upper 
and lower bounds for average sampling rate and expected 
reconstruction distortion per sample of the proposed sampling 
scheme. Simulation results for this sampling scheme are 
presented in Section |VI] which support the derived analytical 
bounds. 

Before presenting theorems, we introduce some notations. 
Suppose that, for all St„ Pi°* < PeiStJ < Pe"^- By 



considering an upper bound on Pe(S'f J, we define 
Tr = argmin (1 - P^"P) J] (1 - 

T 



1 = 1 



P;P(T-l)a, 



+ — (13) 

max V^-^/ 



Tq^ is defined similarly by considering a lower bound on 
Pe(S'fJ. Analogously, r"P and Tl""" can be defined. 

Also, cIq^, an upper bound on the expected reconstruction 
distortion per sample given 63^. = is defined as follows: 



E 

e=i 



Quantities do"™, d"^ and d'f'^ are defined similarly. 

The following theorem provides analytical upper and lower 
bounds on the average sampling rate and the expected recon- 
struction distortion of the greedy sampling scheme introduced 
in Algorithm |6l 

Theorem 8: Consider a Markovian signal defined in (O 
over a large enough time interval [0,Ttot]- Under Assump- 
tion |7] an achievable rate-distortion pair (P., D) of the greedy 
sampling scheme of Algorithm |6] can be bounded as follows: 

^ ^ < R < ^ir + Trir- (14) 



2t;p ' 2r7 



2^1ow 



2r{' 



low 



^ + ^< D <^ + ^. (15) 
2 2- -2 2 ^ ' 



Proof: See Section lylTEl ■ 

Similarly to Corollary |5l T,]™, T^, TI°^ and r"P can be 
calculated by finding roots of some equations. For example: 

Corollary 9: Suppose Tioot is a solution of the following 
equation: 

(l-P,"P)(l-af) + P;P(r-lVLx- y(/^^^ =0 (16) 
where < a < 1 and p > 0. If (1 - Pe"P)(l - a^) < f , then. 



(11) (i) Troot is unique; and 



(ii) T^°- = [T,oot\ or ri°- 



[TrootJ + 1. 



A similar corollary can be stated for Tq^p, rj°" and T"p. 

In Theorem [8] the performance of the proposed sampling 
scheme (i.e., its average sampling rate and the expected 
reconstruction distortion) is bounded. However, it is insightful 
to compare its performance to a genie-aided sampling scheme 
where the state of the underlying Markov chain is known. For 
a genie-aided scheme, Pg°^ — P^^ = 0, and therefore upper 
and lower bounds of Theorem [8] match: 

Corollary 10: For a genie-aided sampling scheme, the fol- 
lowing rate-distortion pair is achievable: 

R - ^r-r + ^T-r, (17) 



D 



22igcnic 

igonic 
"0 I 



2Pf 

^gcnie 



(18) 
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where 

T-l 

T§ =argniin2^(l-ao ) + -, 

1=1 

and 2^8°"'° and df°"'° are defined similarly. 

The proof of Corollary|9]is similar to the one of Corollary|5] 
Also, Corollary [TOl can be derived by using P^°" = P^p = 
in Theorem [8] 

In the first step of this sampling function, we need to 
estimate 63^.. by using m most recently taken samples of the 
signal and compute its probability of error Pe{Ost.)- In the 
following, we give an example of such a scheme. 

Example 11: Suppose m = 2 and we use a maximum 
likelihood estimator. If 6*5^ = 0, the probability distribution 
of X{ti) is N{al'-'X{U^i),l - af'-'), where J\f{n,a^) 
represents a Gaussian distribution with mean ^ and variance 
(T^. The prior probability of this event is Pqo ■ Similarly, 
if dst^ = 1, X{U) is disti-ibuted by JV{af-'X{U^i),l - 
•^1^' )■ The prior probability of this event is p^l^^ ■ Other- 
wise, the distribution of X(ti) is A/'(a^a^'"^ X{ti^i),l — 
of/ al^'^''^^ "')) with the prior probability ("^^^^)pooPn"^ ''' 
for 1 < j < Ti^i — 1. Each of these events corresponds to the 
case 9 St. =2 (i.e., there is a transition within the sampling 
state.). Therefore, by having the observed value of X{ti) and 
using these distributions, a maximum likelihood estimator can 
estimate 6*5^ and compute its error probability. 

Remarks: 

1) If the state space is not large, computations can be per- 
formed off-line and results can be used in the sampling 
function. 

2) When the order of the sampling function is large 
and/or autocorrelation coefficients change continuously, 
a maximum likelihood estimator may not be practically 
interesting. In these cases, we can use previously taken 
samples within a window of size W from the last sample 
(i.e., all taken samples from time ti — W + 1 to the 
time ti) to update or estimate autocorrelation coefficients 
to use in the sampling function. The quality of this 
estimation process depends on the window size W, the 
variation rate of autocorrelation coefficients, and the 
technique used. Two possible methods for estimating 
autocorrelation coefficients are as follows: 

< A gradient-based method. Suppose at the sampling 
state S't;_j, the set of estimated autocorrelation 
coefficients is {f(j) : j > 1}. By taking a sample at 
time rii, these coefficients are updated as follows: 

Hj) + l{X{U)X{U - 3) - f{j)) (19) 

for all possible j's, where := represents an update 
sign, and 7 > is a gradient step size. This 
gradient-based update method can be useful when 
W is not large. 

< A window-based method. If the window size W is 



large and there are enough known samples within 
the window, an empirical value for each autocorre- 
lation coefficient can be computed. 
The estimated autocorrelation coefficients {f{j) : j > 
0} can be used in the generalized linear prediction filter 
in order to design a sampling function similar to the one 
of Algorithmic] 

V. Dynamic Programming-Based TANS 

In greedy TANS, sampling functions are derived based 
on minimizing the sampling cost at each sampling state. 
Hence, it does not take into account the quality of next 
sampling states. Intuitively, the larger the sampling increment 
at the sampling state St^, the lower the quality of the next 
sampling state. Therefore, in general, greedy methods may not 
provide optimal sampling functions with respect to the overall 
sampling cost. 

We consider quality of next sampling states in dynamic 
programming-based TANS methods. For the sampling state 
Sti, a cost-to-go function J/(S'f J is defined as follows: 

JfiSu) = c(5t.,T,)+/3c(5t,+i,T,+i) + --- (20) 

In this setup, < /3 < 1 is called a discount factor. An 
interpretation of this factor is that the cost of the next sampling 
state is less important for the current state policy by a factor 
of f3. Another interpretation of this factor is that the process 
may be ended at each sampling state with probability 1 — /3. 
An optimal cost-to-go function J(S'tJ of the sampling state 
St^ is defined as follows: 

J{Su) = miJf{Su). (21) 

A Bellman-Ford equation (BFE) can be written for this 
problem by using cost-to-go functions of different sampling 
states as follows: 

JiSu) = ini {c{Su,T,) + pE[J{Su^,)]). (22) 

A solution of this Bellman-Ford equation (BFE) is an optimal 
solution for the sampling problem presented in Definition [T] 
when the reconstruction function is causal. We investigate this 
problem for various signal models and sampling setups in this 
section. For some cases where the number of sampling states 
is not large, an optimal sampling function can be derived. 
In other cases where finding this optimal solution is com- 
putationally difficult, we propose sampling functions based 
on approximate dynamic programming (ADP) algorithms. We 
define a quality function (?(>5'tJ for each sampling state St^- A 
greedy solution is used to define this quality function. Then, 
a sampling function can be computed as follows: 

T, = arginf c{Su.T) + l3E[q{Su^,)] (23) 

T 

subject to T > 1. 

In this setup, we consider quality effects of just one sampling 
state ahead. Also, note that the solution of the BFE is optimal 
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if the reconstruction function g{-) is causal. It is a necessary 
assumption of the dynamic programming setup to have a 
separation of sampling state costs at different stages. 



A. An Online Source Coding Scheme Based on TANS 

In this section, we consider a sampling problem where an 
exact DP-based solution can be derived as a solution of the 
BFE (|22] |. Consider a Markovian signal with an underlying 
hidden Markov chain depicted in Figure |3] For simplicity, 
suppose poi = eo ^ 1 and pio = ei <C 1. Other regimes 
of transition probabilities can be analyzed similarly. Suppose 
X{t) is a binary signal generated by this underlying hidden 
Markov chain so that, at state 0, X{t) = 0, and at state 1, 
X{t) = 1. For the reconstruction, we use a causal function that 
selects a most-probable binary sequence to fill missing places. 
Hamming distance is used as an error measure. We call this 
problem an online source coding problem since it does not 
have a compression delay as in an unconstrained block source 
coding scheme. In this problem, TANS can provide some 
compression gain without having any delay. This problem can 
be extended to Markov chains with more states and different 
regimes of transition probabilities. 

In this sampling problem, there are two different sampling 
states: If X{ti) = 0, then, St^ = 0; otherwise, St^ = 1. 
Suppose /(•) is the sampling function (i.e., Ti = f{Sti)-)- 
Suppose f{St, = 0) = To and f{Sf^ = 1) = Ti. Hence,' Tq 
and Ti represent sampling steps at different sampling states. 

Since eo,ei <^ 1, the reconstruction method would choose 
all-0 and all-1 sequences when St^ — and St^ — 1, respec- 
tively. Suppose Sti — 0. Hence, the next sample is taken at 
time ti+To. An error happens if there is one or more Markov 
chain transitions over the time interval [ti + l,ti + Tq — 1]. 
To simplify our analysis, we only consider first-order error 
terms (i.e., we assume at most one transition happens over a 
time interval [ti + l,ti + Tq — 1]). To have this simplifying 
assumption, we need to restrict the sampling increments such 
that maxTojTi <C min{^, ^}. Therefore, the probability of 
having more than one Markov chain transition over a time 
interval of a length Tq or Ti is negligible. 

By considering Hamming distance as a distortion measure, 
the sampling state cost at St^ = can be written as follows: 

To-l 

c{Su,To) = V {1 - e,y-h,{n - j) + ^. (24) 

Note that, (1 — eo)"'^^eo is the probability of not having 
a transition over the first j — 1 samples of the sampling 
interval and having a transition at the jth sample. Hence, 
the considered reconstruction method makes errors on samples 
from time ti+j to i^+To — 1, which corresponds to a Hamming 
distance of To ^ j- A similar argument can be made for the 
case of = 1. 

By considering these sampling state cost functions, the BFE 



can be written as follows: 

J{Su^Q) = min [c(5i, =0,ro) 

J-a-.-i-i 

+/3(l-eo)^V(5t, =0) 
+/3(l-(l-eor°)J(5t. =1)], 
J{Su=l) = min[c(5i, =l,ri) 

+/3(1 - = 1) 

+/3(l-(l-eifi)J(5i, =0)]. 

Since these BFEs have only two variables, various numerical 
and analytical methods can be applied to find their solution, 
which in turn corresponds to an optimal sampling scheme in 
TANS (e.g., see |17J). Simulation results for this sampling 
scheme are given in Section |VT] 

B. Approximate Dynamic Programming Methods 

Finding a solution of the BFE of a DP-based sampling 
function may not be practically feasible if the sampling state 
space is large or is unknown (or is known partially). In these 
cases, approximate dynamic programming (ADP) algorithms 
can provide suboptimal solutions with a reasonable computa- 
tion complexity. In this section, we investigate an ADP-based 
sampling function of TANS introduced in (l23T l. To use this 
approximate algorithm, for any sampUng state S'*-, a quality 
measure q{Sti) is assigned. We use a greedy sampling solution 
to define this quality function. 

Consider a Markovian signal X{t) described in Sec- 
tion IIII-BI A greedy sampling function for this signal is 
introduced in Algorithm |6] We refer to this greedy sampling 

function as /S'^dy^.)^ ^j^gj.^ ^.greedy ^ ^greedy (5.^ 

A quality function of each state is defined as 

qiSu)^lTr''' (25) 

where 7 is a scaling parameter Intuitively, the larger the 
greedy sampling step, the higher the quality of the sampling 
state. Therefore, an ADP-based sampling function of TANS 
can be derived by using the optimization setup of ( l23T l. 

For computing the expected quality of the next sampling 
state, transition probabilities among different sampling states 
should be known. Suppose, at a sampling state St^, the next 
sample is taken after a time interval Ti. The value of the 
sample X{ti+Ti) is a random variable with a mean X{ti+Ti), 
which can be computed by a generalized linear prediction 
filter The probability distribution of X{ti + Ti) determines 
the probability distribution of the next sampling state St^^i- 
Therefore, the expected value of the quality function of the 
next state can be computed by using this probability distri- 
bution. However, to simplify this sampling function further, 
one may approximate this expected quality by the quality of 
the most probable next state, which has a sample value of 
X{ti + Ti) at time t^+i. We call this state S**^,,.!. Therefore, a 
more simplified sampling function based on an ADP algorithm 
can be written as 

Ti = argmin c(S't, , T) + J. 

T 



8 



FEIZI, GOYAL, AND MEDARD 



+ 

+ 



Rate award parameter p 



Fig. 4. Illustration of Corollary |5] by simulation. The red curve is the solution 
of (9). The blue curve is optimal sampling rate T* . Note that, their difference 
is always less than 1. 



For any given state St^ and Ti — T, 9(5'f._|_j) can be 
computed. Intuitively, the term f3q{St^^-^) is a correction term 
for the greedy solution considering the quality of the next 
sampling state. In this scheme, the quality of only one future 
sampling state is considered. However, one can extend this 
algorithm to consider the quality of more than one future 
sampling state. 

Algorithm 12: An approximate dynamic programming- 
based sampling function for the Markovian signal of (O can 
be summarized as follows: 

• Step i,0: Compute §3^^, PeiStJ and q{St^^J. 

. Step i,l: Compute Ti = mm c{St^,T\§s, J + l3q{St,^,). 

• Step i,2: Take a sample at time ti + Ti. 

• Step i,3: Compute S't^+i. Repeat. 

Simulation results for this sampling procedure are presented 
in Section [Vl] 

VI. Simulation Results 

In this section, we evaluate the performance of the pro- 
posed sampling schemes by simulations and compare their 
performance against uniform sampling. In uniform sampling, 
the sampling rate is always in the form of i? = l/Tuni, 
, where Tuni is a positive integer To be able to compare 
the performance of different methods with uniform sampling 
at different rates, we modify the uniform sampling setup to 
capture all possible sampling rates. To do this, for a given 
rate R = 1 /Tuni where Tuni is not an integer number, the ith 
sample is taken at time ti = round(Tuni)- 

First, we consider an autoregressive signal model introduced 
in (|2]l. In Theorem HI we show that an optimal sampling 
scheme for this signal is uniform. Also, Corollary |5] provides 
a straightforward way to compute the optimal sampling rate. 
Figure|4]illustrates this corollary for the case of a = 0.99 (i.e., 
noise power is 0.02) and for different p values. As shown in 
this figure, the difference between the root of equation Q and 
the optimal increment T* is always less than 1. 

Now, we consider Markovian signals introduced in (O. 
Transition probabilities of the underlying MC are assumed 
to be 0.001. We demonstrate the performance of different 
sampling methods on a rate-distortion plots (Figures |5] |6] 
and I?). Rate refers to average sampling rate and distortion 
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Average Distortion 
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Fig. 5. Analytical rate-distortion curves of Theorem [8] for a Markovian 
signal of equation (5) with the signal parameters ao = 0.01 and ai = 
0.99. Pe represents en'or probability of the estimation process. Pe = curve 
con'esponds to the so-called genie-aided scheme. 
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Fig. 6. Average sampling rate versus average reconstruction distortion for a 
Markovian signal with parameters ao = 0.01 and ai = 0.99 for methods: 
(i) uniform sampling with causal line-connecting (CLC) reconstruction, (ii) 
uniform sampling with non-causal line-connecting (NCLC) reconstruction, 
(iii) uniform samphng with generalized hnear prediction (GLP) filtering (iv) 
greedy TANS with generahzed linear prediction (GLP) filtering, and (v) 
analytical lower bound for greedy TANS based on Theorem [s] 



refers to average reconstruction distortion per sample. Lower 
curves in these plots indicate better performance. 

Figure |5] shows analytical rate-distortion curves of Theo- 
rem [8] for a Markovian signal of equation (O for different 
estimation error probabilities. Here, noise power in state of 
the Markov chain is 0.05 (i.e., ao ~ 0.97) and the noise power 
in state 1 is 0.5 (i.e., ai « 0.7). As illustrated in this plot, the 
lower the error probability, the better the performance. Note 
that the case where = is referred as a genie-aided scheme. 

Figures |6] and [7] show rate-distortion curves achieved by 
simulations for various schemes. The signal model is Marko- 
vian ([3]) with parameters ao = 0.01 and ai = 0.99 in schemes 
considered in Figure |6] and, ao — 0.7 and ai — 0.97 in the 
ones of Figure [T] In greedy schemes, we use a maximum 
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Fig. 7. Average sampling rate versus average reconstruction distortion for 
a Markovian signal with parameters ag = 0.7 and ai = 0.97 for methods: 
(i) uniform sampling with causal line-connecting (CLC) reconstruction, (ii) 
uniform sampling with non-causal line-connecting (NCLC) reconstruction, 
(iii) uniform sampling with generalized linear prediction (GLP) filtering (iv) 
greedy TANS with generalized linear prediction (GLP) filtering, and (v) 
analytical lower bound for greedy TANS based on Theorem [s] 



likelihood estimation block with to = 10 to estimate the state 
of the underlying Markov chain. For greedy TANS, we use 
generalized linear prediction (GLP) filter as the reconstruction 
method. Note that, this reconstruction is causal. For uniform 
sampling, we use three reconstruction methods: causal line- 
connecting (CLC), non-causal Une-connecting (NCLC) and 
GLP filtering. Although, it is not fair to compare performance 
of greedy TANS with causal reconstruction with a uniform 
sampling scheme with non-causal reconstruction, in the case 
of Figure |6l greedy TANS outperforms uniform sampling 
schemes including the one with a non-causal reconstruction 
method. In the case of Figure |2l greedy TANS outperforms 
uniform schemes with causal reconstructions. In the low- 
distortion regime (distortion less than 0.08), it also outper- 
forms uniform sampling with non-causal reconstruction. 

As illustrated in these figures, genie-aided greedy TANS 
provides an analytical lower bound for greedy TANS in the 
rate-distortion plot. The proposed greedy TANS performs 
closely to this lower bound. Also, by choosing the estimation 
error probability 0.05 (estimated from simulations), an analyti- 
cal upper bound for greedy TANS can be achieved as proposed 
in Theorem [S] Moreover, by comparing two greedy TANS 
schemes of these figures, we notice that the more different 
a values of MC states, the more gain is provided by TANS 
framework. 

Figure[8] shows the performance of a dynamic programming- 
based TANS scheme for an online source coding application 
explained in Section IV-AI Here, we assume eg = 0.1 and 
ei — 0.01. To solve the Bellman-Ford equation dZSb . a value- 
iteration method is used |17|. As shown in this plot, a DP- 
based TANS scheme outperforms uniform sampling. 

Figure |9] illustrates performance of a TANS scheme based 
on approximate dynamic programming for a Markovian signal 
explained in Algorithm [TJ] Here, we assume that underlying 



Fig. 8. A rate-distortion plot of a dynamic programming-based TANS for 
an online source coding application explained in Section IV-AI 
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Fig. 9. Comparison of a dynamic programming-based TANS with greedy 
TANS for a Markovian signal model. 



Markov chain transition probabiUties are 0. 1 (i.e., poi — pio — 
0.1). The signal parameters are assumed to be ao — 0.7 and 
ai — 0.99. As illustrated in this figure, a TANS scheme based 
on ADP outperforms the greedy one. 

VII. Proofs 

A. Proof of Theorem |2] 

To find a solution of the optimization problem we use 
similar techniques as for the regular linear prediction filter 
fT6i . Note that in an optimal scheme the error term should be 
orthogonal to all known samples: 

E[X{U^k)e*{t)\=Q (26) 

for fc = 0, ...,TO - 1, where e*{t) X(t) - 

By using ( |26] | and Q, optimal weights w* , for < i < 



10 



FEIZI, GOYAL, AND MEDARD 



TO — 1 should satisfy the following set of linear equations: 



Now, consider the right hand side (RHS) of the BFE ( l22l i: 



(27) 



RHS of BFE = 



i=0 



for fc = 0,...,TO — 1. By using matrix notations of (|6|l, linear 
equations of ( |27] ) can be written as follows: 

p = Rw;. 



B. Proof of Theorem |i] 

Since X{t) has zero mean and X{t) = X{t) + e*{t), by 
using ( |26l ), we have 

al , T) = a| - 4 = r (0) - 4 , (28) 
where X{t) = (w*)^(Xs,J. Therefore, 

4 = 



= (w;)^i?[(Xsj(Xsj^]w: 
= (w;)^Rw; 



T * 



(29) 



Equations ( |28l l and ( |29] l establish the theorem. 



C. Proof of Theorem |4] 

By using the definition of an AR(1) signal, the sampling 
state cost can be written as follows: 



T-l 



(30) 



Hence, an optimal greedy sampling solution for this sampling 
state cost can be computed as follows: 



T-l 

Tg^^^dy = arg min ^ (1 - ) + ^ . (3 1) 



Now, we show that this optimal greedy TANS (which is 
a uniform sampling scheme) performs closely to an optimal 
sampling scheme, which may be nonuniform. Consider a uni- 
form sampling state St^ with an inter-state sampling increment 
^*iccdy The sampling state cost of this uniform sampling 
scheme is referred to by Cmin (this sampling state cost can be 
achieved by having Tgrocdy ^ solution of the optimization 
setup First, we show that this uniform sampling scheme 
satisfies the BFE ( |22] |. Since under this sampling scheme, 
Sti+i = Sti, therefore, the cost-to-go function at the sampling 
state can be written as follows: 

Jf{Su) = c{Su,T,)+(ic{Su^„T,+^) + --- 
= c(5t.,T;„„dy)(l + /3 + /3' + ---) 

(32) 



1-/3 



^min 

Jf{Su)- 



1-/3 



(33) 



Therefore, the above uniform sampling scheme satisfies the 
BFE when 5t. (i.e., the initialization state) happens to be a 
uniform sampling state with an inter-state sampling step size 
^*iccdy '^he process starts from another sampling state, we 
assume that there is always a way to reach to this uniform 
sampling state (e.g., take to uniform samples with a sampling 
increment T*^^^^^). Moreover, the mapping functions between 
policies and costs are continuous. Hence, a small difference 
in costs due to initialization effects has a small effect in 
sampling policies. Therefore, for an AR(1) signal model, the 
optimal greedy TANS (which is uniform) performs closely to 
an optimal sampling scheme (which may be nonuniform). 



p/(r(T+l)) forTe [1,00) 



D. Proof of Corollary \5\ 

Define h{T) ^ (1 
Note that h{T) is a continuous function over this interval. 
Since = (1 - a^) - p/2 < and limT^^o h{T) = 1 > 0, 
by using the mean value theorem, h has a root Troot in [1, 00) ■ 
Since dh{T)/dT > 0, this root is unique, completing the proof 
of part (i). 

For an AR(1) signal, we have: 

h{T) = c{Su,T + l) - c{St,,T) ^ (1 - a'^) - ^^^\^y 

Since T* = argmin^ c(S't. ,T) and also T* is an integer, 
either T* = L^'rootJ or T* = [TrootJ + 1, completing the 
proof of part (ii). 

E. Proof of Theorem \8\ 

Under conditions of ( fTOb . (fTTI) shows the sampling state cost 
given 9 St = 0. By using the sampling scheme proposed in 
Algorithmic! we have: 



T,^(irgmmc{St,,T\9s,. = 0). 

T 



(34) 



Since > 1, by having P^™ < P(S'tJ < P^p, given 



9 St. = 0, we have: Tf 



low 



Similarly, one can 



show that, given Os^^ = 1, T'"* < T, < T"p. Since the 
underlying Markov chain is symmetric (i.e., pqi ~ Pio), in 
steady state, 63^. = approximately half of the time and 
dst. = 1 approximately half of the time. Hence, the number 
of samples taken in that state by using the proposed sampling 
scheme of Algorithm |6] is bounded between 

Ttot , Ttot J Ttot , Ttot 



and 



22^iow 2^^^^ 2T^P ' 

This demonstrates the sampling rate bounds. By using bounds 
on Tj and Pe(S't. ) in ( fTTI ). deriving bounds on the expected 
reconstruction distortion is straightforward. 
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VIII. Conclusion 

In this paper, we introduced a new framework for an 
adaptive nonuniform sampling scheme called time-stampless 
adaptive nonuniform sampling (TANS). The key idea of 
this framework is that, time increments between samples are 
computed by using a function of previously taken samples. 
Therefore, keeping sampling times (time stamps), except ini- 
tialization times, is not necessary. We introduced two meth- 
ods to design sampling functions for discrete-time stochastic 
signals: a greedy method, and a method based on dynamic 
programming. We analyzed the performances of these schemes 
by computing (or bounding) their trade-offs between sampling 
rate and expected reconstruction distortion for autoregressive 
and Markovian signals. We showed that, by being time- 
stampless and opportunistically adapting to local signal char- 
acteristics, TANS can provide significant rate-distortion gains, 
which can be translated to improved power efficiency in some 
applications. 
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